2024-12-25 07:59:08.AIbase.14.2k
Alibaba Tongyi Qwen Open Source Visual Reasoning Model QVQ-72B-Preview
The Qwen team recently announced the open-source release of their latest multimodal reasoning model, QVQ, marking an important step forward in the capabilities of artificial intelligence in visual understanding and complex problem-solving. This model is based on Qwen2-VL-72B and aims to enhance AI reasoning capabilities by combining language and visual information. In the MMMU evaluation, QVQ achieved a high score of 70.3, demonstrating significant performance improvements over Qwen2-VL-72B-Instruct in several math-related benchmark tests.